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' Abstract 

This paper studies maximum likelihood(ML) decoding in error-correcting codes as ra- 
tional maps and proposes an approximate ML decoding rule by using a Taylor expansion. 
The point for the Taylor expansion, which will be denoted by p in the paper, is properly 
■ chosen by considering some dynamical system properties. We have two results about this 

approximate ML decoding. The first result proves that the order of the first nonlinear terms 
in the Taylor expansion is determined by the minimum distance of its dual code. As the 
second result, we give numerical results on bit error probabilities for the approximate ML 
decoding. These numerical results show better performance than that of BCH codes, and 
indicate that this proposed method approximates the original ML decoding very well. 
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1 Introduction 
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This paper proposes a new perspective to maximum likelihood(ML) decoding in error-correcting 
codes as rational maps and shows some relationships between coding theory and dynamical 
systems. In Section ri.il 11.21 and ll.31 we explain notations and minimum prerequisites of coding 
^ , theory (e.g., see [2]). The main results are presented in Section HDu 

1.1 Communication Systems 

A mathematical model of communication systems in information theory was developed by Shan- 
non [TJ. A general block diagram for visualizing the behavior of such systems is given by Fig- 
ure II. 11 The source transmits a fc-bit message m = (mi ■ ■ ■ irik) to the destination via the 
channel, which is usually affected by noise e. In order to recover the transmitted message 
at the destination under the influence of noise, we transform the message into a codeword 
x = (x\ ■ ■ ■ x n ), n > k, by some injective mapping <p at the encoder and input it to the chan- 
nel. Then the decoder transforms an n-bit received sequence of letters y = (y\ ■ ■ ■ y n ) by some 
decoding mapping ip in order to obtain the transmitted codeword at the destination. Here we 
consider all arithmetic calculations in some finite field and in this paper we fix it as F2 = {0, 1} 
except for Section HJ As a model of channels, we deal with a binary symmetric channel (BSC) 
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in this paper which is characterized by the transition probability e (0 < e < 1/2). Namely, with 
probability 1 — e, the output letter is a faithful replica of the input, and with probability e, it is 
the opposite of the input letter for each bit (see Figure \T7Z\i . In particular, this is an example of 
memory less channels. 

Then, one of the main purposes of coding theory is to develop a good encoder-decoder pair 
(</?, ip) which is robust to noise perturbations. Hence, the problem is how we efficiently use the 
redundancy n > k in this setting. 



Xi = 



(source) Cl) ( ^Encoderj 1^) ^Channel) cz) ( Woder j ( Destination 

m x V x 




Xi = l 



Vi = i 



Figure 1.1: Communication system 



Figure 1.2: Binary symmetric channel 



1.2 Linear Codes 

A code with a linear encoding map (p is called a linear code. A codeword in a linear code can 
be characterized by its generator matrix 



G 



9ln 



(gi ■ ■ ■ 9n) 




. n, 



\ 9kl ■ ■ ■ 9kn 

where each element gij 6 F2. Therefore the set of codewords is given by 

C = {(xi ■ ■ ■ x n ) = (mi ■ ■ ■ m k )G \ mi £ F 2 , }, \C = 2 k . 

Here without loss of generality, we assume gi ^ for all i = 1, ■ ■ ■ , n and rank G = k. We call 
k and n the dimension and the length of the code, respectively. 

Because of the linearity, it is also possible to describe C as a kernel of a matrix H whose 
m = n — k row vectors are linearly independent and orthogonal to those of G, i.e., 



/ h 



C = {(xi 



(xi 



x n )tf* = 0}, H 
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K, 



(hi 



J 



where H means the transpose matrix of H. This matrix H is called a parity check matrix of 
C. The dual code C* of C is defined in such a way that a parity check matrix of C* is given by a 
generator matrix G of C. 

The Hamming distance d(x, y) between two n-bit sequences x, y S FJ? is given by the number 
of positions at which the two sequences differ. The weight of an element x £ Fg is the Hamming 
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distance to 0, i.e., d(x) := d(x, 0). Then the minimum distance d(C) of a code C is defined by 
two different ways as 

d(C) = min{<i(x, y) \ x, y G C and x ^ y} = min{d(x) | ^ x G C}. 

Here the second equality results from the linearity. It is easy to observe that the minimum 
distance is d(C) = d if and only if there exists a set of d linearly dependent column vectors of H 
but no set of d — 1 linearly dependent column vectors. 

For a code C with the minimum distance d = d(C), let us set t := [(d — l)/2j, where [a\ is 
the integer part of a. Then, it follows from the following observation that C can correct t errors: 
if y G and d(x, y) < t for some x G C then x is the only codeword with d(x, y) < t. In this 
sense, the minimum distance is one of the important parameters to measure performance of a 
code and is desirable to design it as large as possible for the robustness to noise. 

1.3 Maximum Likelihood Decoding 

Let us recall that, given a transmitted codeword x, the conditional probability P(y\x) of a 
received sequence y G FJ? at the decoder is given by 

P(y\x) = P{yi\xi) ■ ■ ■ P{y n \x n ) 

for a memoryless channel. Maximum likelihood(ML) decoding tp : FJ? 3 y i-> x G FJ? is given by 
taking the marginalization of P{y\x) for each bit. Precisely speaking, for a received sequence y, 
the i-th bit element Xj of the decoded word x is determined by the following rule: 

( 1, Z*ecP(y\x)<E*ecP(y\x) 

Xi := < xi=o xi=i , i = l,---,n. (1.1J 

[ 0, otherwise 

In general, for a given decoder -ip, the bit error probability P£ it = maxjPf, • • • , P%}, where 
Pf = ~ s xi,xi), x = (xi,--- ,x n ) = 4>(y), 5 a)b = 

x£C,yeF™ 

is one of the important measures of decoding performance. Obviously, it is desirable to design 
an encoding-decoding pair whose bit error probability is as small as possible. It is known that 
ML decoding attains the minimum bit error probability for any encodings under the uniform 
distribution on P(x). In this sense, ML decoding is the best for all decoding rules. However 
its computational cost requires at least 2 k operations, and it is too much to use for practical 
applications. 

From the above property of ML decoding, one of the key motivation of this work comes 
from the following simple question. Is it possible to accurately approximate the ML decoding 
rules with low computational complexity? The main results in this paper give answers to this 
question. 

1.4 Main Results 

Let us first define for each codeword x G C its codeword polynomial F x (ui, ■ ■ ■ , u n ) as 

n 

F x {ui, ■ ■ ■ ,u n ) := Y[pi(ui), Pi{ui) = 
i=i 



j 1, a = b 
{ 0, a + b 



J Uj, Xi — 1, 

\ 1-Ui, Xj = 0. 
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Then we define a rational map / : I n — > I n , I = [0,1], by using codeword polynomials as 

/ : ,u n ) (->• (u[,--- ,u' n ), 

H(u) 

H(u):=J2Fx(u), (1.2) 

where it = (ui, • • • ,u n ). This rational map plays the most important role in the paper. It is 
sometimes denoted by fa, when we need to emphasize the generator matrix G of the code C. 
For a sequence y € Fg, let us take a point u° G I n as 

«*° = { »•=!,■■■,«, (i-3) 

where e is the transition probability of the channel. Then it is straightforward to check that 
Fx( u °) = P{y\ x )- Namely, the conditional probability of y under a codeword x € C is given 
by the value of the corresponding codeword polynomial F x (u) at u = u°. Therefore, from the 
construction of the rational map, ML decoding (jl.ip is equivalently given by the following rule 

</> :F!3»rti6 F5, 

*•=*'»' : = { J: /t»)<!/2' !=1 --'"- (L4) 

In this sense, the study of ML decoding can be treated by analyzing the image of the initial point 
(jl.3p by the rational map (jl.2p . Some of the properties of this map in the sense of dynamical 
systems will be studied in detail in Sectional We will also discuss in Section [5] that performance 
of a code can be explained by these properties. 

For the statement of the main results, we only here mention that this rational map has a 
fixed point p := (1/2, • • • , 1/2) for any generator matrix (Proposition 12, 2j ). Let us denote the 
Taylor expansion at p by 

/(«) = p + Jv + fW(v) + ■■■ + fV\v) + 0(v l+1 ), (1.5) 

where v = (v\ , • • • , v n ) is a vector notation of v% = m — 1 /2, i = 1, ■ ■ ■ , n, J is the Jacobi matrix at 
p, f^'(v) corresponds to the i-th order term, and 0(v l+1 ) means the usual order notation. The 
reason why we choose p as the approximating point is related to the local dynamical property 
at p and will be explained in Section 15.21 

By truncating higher oder terms 0(v l+1 ) in (11.5P and denoting it as 

f( u )= p+Jv + fW( v ) + ... + f(l)( v ) 7 

we can define the l-th approximation of ML decoding by replacing the map f(u) in (|1.4p with 
f(u), and denote this approximate ML decoding by ip : FJ? — > FJJ, i.e., 



1, fi(u°)> 1/2 
0, fi(u°)< 1/2 



Let us remark that the notations / and ip do not explicitly express the dependence on I for 
removing unnecessary confusions of subscripts. 
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1.4.1 Duality Theorem 



We note that there are two different viewpoints on this approximate ML decoding. One way is 
that, in the sense of its precision, it is preferable to have an expansion with large I. On the other 
hand, from the viewpoint of low computational complexity, it is desirable to include many zero 
elements in higher order terms. The next theorem states a sufficient condition to satisfy these 
two requirements. 

Theorem 1 Let I > 2. If any distinct I column vectors of a generator matrix G are linearly 
independent, then the Taylor expansion (jl.5p at p of the rational map (jl.2p takes the following 
form 

/(«)=« + /«)(„) + 0(v l+1 ), 
where the i-th coordinate ff\v) of f( l \v) is given by 

/fW= E C-2) 1 " 1 ^ -«*« = — E (l-2n il )---(l-2n i! ), (1.7) 
®i = {(n> ■ ■ ■ I 1 < h < ■ ■ ■ < k < n, i k ^i (k = !,-■■ ,1), gi + g^A h g k = 0} . 



First of all, it follows that the larger the minimum distance of the dual code C* is, the more 
precise approximation of ML decoding with low computational complexity we have for the code 
C with the generator matrix G. Especially, we can take I = d(C*) — 1. 

Secondly, let us consider the meaning of the approximate map f(u) and its approximate 
ML decoding tp. We note that each value u®(i = 1, • • • , n) in (|1 .3|) for a received word y G 
expresses the likelihood P{yi\xi = 1). Let us suppose 0^ ^ 0. Then, from the definition of u°, 
each term in the sum of (|1.7p satisfies 

4(1 - !K) ■■•(!- 2u H ) { < °' * ^ + • • • + I J , (,•!,...,<!)€ Of. 



2 V JW ' ^ i >0, if y il+ ...+ yi| =1 

When + "' + yi } = 0(= l,resp.), this term decreases (increases, resp.) the value of initial 
likelihood u®. In view of the decoding rule (ll.4p . this induces to be decoded into Xi = 0(= 
l,resp.), and this actually corresponds to the structure of the code gi + gi 1 + • • • + g^ = 
appearing in ®f ■ In this sense, the approximate map f(u) can be regarded as renewing the 
likelihood (under suitable normalizations) based on the code structure, and the approximate 
ML decoding ip judges these renewed data. From this argument, it is easy to see that a received 
word y £ C is decoded into y = ip(y) € C, i.e., the codeword is decoded into itself and, of course, 
this property should be equipped with any decoders. 

We also remark that Theorem [1] can be regarded as a duality theorem in the following sense. 
Let C be a code whose generator (resp. parity check) matrix is G (resp. H). As we explained 
in Section 11.21 the linear independence of the column vectors of H controls the minimum dis- 
tance d(C) and this is an encoding property. On the other hand, Theorem [1] shows that the 
linear independence of the column vectors of G, which determines the dual minimum distance 
d(C*), controls a decoding property of ML decoding in the sense of accuracy and computational 
complexity. Hence, we have the correspondence between H/G duality and encoding/decoding 
duality. In Corollary 14.51 we will consider this duality viewpoint in a setting of geometric Reed- 
Solomon/Goppa codes. 
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1.4.2 Decoding Performance 



We show the second result of this paper about the decoding performances of the approximate 
ML (jl.6p . For this purpose, let us first examine numerical simulations of the bit error probability 
on the BSC with the transition probability e = 0.16. We also show numerical results on BCH 
codes with Berlekamp-Massey decoding for comparison. The results are summarized in Figure 
PI 
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X random + proposed approx. ML (3rd order) 
random + proposed approx. ML (2nd order) 
SK BCH + Berlekamp-Massey 



0.1 0.2 0.3 0.4 

code rate 

Figure 1.3: BSC with e = 0.16. Horizontal axis: 
code rate r = k/n. Vertical axis: bit error prob- 
ability, x: random codes with the 3rd order 
approximate ML decoding. +: random codes 
with the 2nd order approximate ML decoding. 
*: BCH codes with Berlekamp-Massey decod- 
ing. 
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0.1 0.2 0.4 0.6 1 

transition probability 

Figure 1.4: Comparison of decoding perfor- 
mances for a BCH code with k = 7 and n = 63. 
Horizontal axis: transition probability e. Verti- 
cal axis: bit error probability, x: 3rd order ap- 
proximate ML decoding. *: Berlekamp-Massaey 
decoding. □: ML decoding. 



Here, the horizontal axis is the code rate r = k/n, and the vertical axis is the bit error 
probability. The plots + (x resp.) correspond to the 2nd (3rd resp.) order approximate ML 
(|1.6p . and * are the results on several BCH codes (n = 7, 15, 31, 63, 127, 255, 511) with 
Berlekamp-Messey decodings. For the proposed method, we randomly construct a systematic 
generator matrix in such a way that each column except for the systematic part has the same 
weight (i.e. number of non-zero elements) w. To be more specific, the submatrix composed by 
the first k columns of the generator matrix is set to be an identity matrix in order to make the 
code systematic, while the rest of the generator matrix is made up of k x k random matrices 
generated by random permutations of columns of a circulant matrix, whose first column is given 

by 

(V-- i,(V ••())*. 

w 

The reason for using random codings is that we want to investigate average behaviors of the 
decoding performance, and, for this purpose, we do not put unnecessary additional structure 
at encodings. The number of matrices added after the systematic part depends on the code 
rate, and the plot for each code rate corresponds to the best result obtained out of about 100 
realizations of the generator matrix. Also, we have employed w = 2 and 3 for the generator 
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matrices of the 3rd and the 2nd order approximate ML, respectively. Moreover, the length of 
the codewords n are assumed to be up to 512. From Figure [L3| we can see that the proposed 
method with the 3rd order approximate ML (x) achieves better performance than that of BCH 
codes with Berlekamp-Massey(*). It should be also noticed that the decoding performance 
is improved a lot from the 2nd order to the 3rd order approximation. This improvement is 
reasonable because of the meaning of the Taylor expansion. 

Next, let us directly compare the decoding performances among ML, approximate ML (3rd 
order) and Berlekamp-Massey by applying them on the same BCH code (k = 7, n = 63). The 
result on the bit error probability with respect to transition probability is shown in Figure 11.41 
This figure clearly shows that the performance of the 3rd order approximate ML decoding is far 
better than that of Berlekamp-Massey decoding (e.g., improvement of double-digit at e = 0.1). 
Furthermore, it should be noted that the 3rd order approximate ML decoding achieves a very 
close bit error performance to that of ML decoding. Although we have not mathematically 
confirmed the computational complexity of the proposed approach, the computational time 
of the approximate ML (3rd order) is much faster than ML decoding. This fact about low 
computational complexity of the approximate ML is explained as follows: Non-zero higher order 
terms in (|1.7p appear as a result of linear dependent relations of column vectors of G, however, 
linear dependences require high codimentional scenario. Hence, most of the higher order terms 
become zeros. As a result, the computational complexity for the approximate ML, which is 
determined by the number of nonzero terms in the expansion, becomes small. 

In conclusion, these numerical simulations suggest that the 3rd order approximate ML de- 
coding approximates ML decoding very well with low computational complexity. We notice 
that the encodings examined here are random codings. Hence, we can expect to obtain better 
bit error performance by introducing certain structure on encodings suitable to this proposed 
decoding rule, or much more exhaustive search of random codes. One of the possibility will be 
the combination with Theorem [TJ On the other hand, it is also possible to consider suitable 
encoding rules from the viewpoint of dynamical systems via rational maps f 1 1 . 2 j> . This issue is 
discussed in detail in Section [5j In any case, finding suitable encoding structure for the proposed 
decodings is one of the important future problem. 

The paper is organized as follows. In Section [21 we study properties of the rational map 
(11. 2ft in view of a discrete dynamical system and show some relationships to coding theory. We 
also show that this discrete dynamical system is related to a continuous gradient dynamical 
system du/dt = gr ad (log if (it)). Section [3] deals with relationship between a generator matrix 
of a code and its Taylor expansion (|1.5p . The proof of Theorem [H which is a direct consequence 
of Proposition 12.41 and Proposition 13. 11 is shown in this section. In Section HI we apply Theorem 
H]to geometric Reed-Solomon/Goppa codes in Corollary 14. 51 with a simple example by using the 
Hermitian curve. Finally, we discuss the future problems on this subject as an intersection of 
dynamical systems and coding theory. 

2 Rational Maps 

In this section, we discuss ML decoding from dynamical systems viewpoints. Let G be a k x n 
generator matrix for a code C. We begin with showing the following easy consequence of linear 
codes, which will be used frequently throughout the paper. 
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Lemma 2.1 For i G {1, • • • ,n}, let us denote subcodes of C with X{ = and Xi = 1 respectively 
by 

C(xi = 0) = {x G C | = 0} , C(xj = 1) = {x G C | Xi = 1} . 
Then IJCfo = 0) = (JC^ = 1) = 2 fe - 1 . 

Proof. From the assumption on the generator matrix, we have g\ ^ and let / > be the 
number of 1 in gi. Then we have 

l l 
$C( Xi = 0) = i°i x 2 ^< ttC(x 4 = 1) = x Cj x 2 k -\ 

3=0 j=0 
j:even j:odd 

where the symbol iCj means the number of combinations for taking j elements from / elements. 
However these summations of combinations are obviously equal because 

o = (i-iy = j2 lCj (-iy= J2 i c i - E i°r 

j=0 j=0 j=0 

j':even j':odd 

Therefore tyCfa = 0) = \C{xi = 1) = 2 k ~ 1 . I 

Next, let us characterize the codewords in C and the non-codewords in F?? \ C by means of the 
rational map (jl.2p . It should be remarked that a codeword polynomial F x (u) for u = (ui, • • • , u n ) 
with Ui 6 dl = {0, 1}, i = 1, • • • , n, takes its value 



F x (u) 



1, u = x, 
0, u ^ x. 



Here we identify a point u = • • • , u n ) G I ra , Uj G 3/, i = 1, • • • , n, with a point it G F?, by 
a natural inclusion F2 ^ /, and this convention will be used frequently in the paper. We also 
define a point u G I n as a pole of the rational map f)1.2|) if i2"(it) = 0. The boundary and the 
interior of a set A are denoted by dA and Int(-A), respectively. 



Proposition 2.2 XTie followings hold for the rational map (jl.2p .- 

1. p := (1/2, • • • , 1/2) G I n is a fixed point. 

2. Let u G Fg . Then u is a fixed point if and only if u G C. 

3. The set of poles is given by 



S := iu G 8{I n ) 
Especially, ¥%\C CS 



there is no code word x G C with (x^, ■ ■ ■ , x^) = (u^, • • • , u^), 
where it^, ■ ■ ■ ,Ui t G F2, and < Ui < 1, i 7^ ii,- • • ,ii 
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Proof. For the statement 1, let us note that F x {p) = (l/2) n for each codeword x G C. Then 
Lemma I2TT1 leads to fi(p) = 1/2, i = 1, • • • , re. 

For the statement 2, let us suppose it G C. Then, from the remark before this proposition, 
fi(u) = l(or resp.) if itj = l(or resp.) for each i. Hence f(u) = u. On the other hand, if 
it G F£ \ C, then H (it) = 0. It means that it is a pole and can not be a fixed point. 

For the statement 3, let us first note that u G S if and only if F x (u) = for any codeword 
x G C, because F x (u) > for u G I n . Therefore 5 C <9(i" n ), since Fa, (it) > for u G Int(/ n ). 

Let u G <9(/ n ) such that u^,--- ,Ui t G F2 and < itj < 1 for i ^ i%, ■ ■ • , ij. Then, if 
there exists a codeword x G C such that (:%,••• = (n^,--- , u^), then the value of its 

corresponding codeword polynomial is F x (u) > 0. So, u £ S. On the other hand, if there is no 
such codeword, then H(u) = 0, it means u G S. I 

From this proposition, the rational map (|1.2p has information of not only all codewords C as 
fixed points but also non-codewords F2 \ C as poles. We call these fixed points codeword fixed 
points. The following proposition shows that all of the codeword fixed points are stable. 

Proposition 2.3 Let a parity check matrix do not have zero column vectors, i.e., there exists 
no codeword with weight 1. Let u be a codeword fixed point. Then the Jacobi matrix of the 
rational map (jl.2p at u is the zero matrix. Hence, u is a stable fixed point. 



Proof. Let us denote the i-th element of the rational map (|1.2p by 

h{u) 



H(u) 



and denote the derivatives of Ij and H with respect to Uj by I? and H 3 , respectively, for 
the simplicity of notations. In what follows, we will also use these notations for higher order 
derivatives in a similar way (like H lJ = ^.q u . )■ Then the derivative dfi/duj(u) is given by 



= {li(u)H(u) - Ii(u)Hi(u)} /H\u) 



Since u G C, we have H(u) = 1. Let us consider the two cases ut = and m = 1, separately. 

For itj = 0, we have Ii(u) = 0. Similarly we have J| (it) = if i 7^ j. Hence, dfi/duj(u) = 
in this case. On the other hand, if i = j, from the assumption on the parity check matrix, we 
have no codeword u G C such that the only difference from u occurs at the i-th bit element, i.e., 
itj = itj for j ^ i but Uj 7^ U{. Therefore, Ij (it) = 0, and it again leads to dfi/duj(u) = 0. The 
case Ui = 1 can be proven by the similar way. I 

Let us next discuss properties of the fixed point p = (1/2, • • • , 1/2). 

Proposition 2.4 Let G = (g\ ■ ■ ■ g n ) be a generator matrix. Then, the Jacobi matrix J of the 
rational map (|1.2p at p is determined by 

Jij = 1, if 9i = 9j, 
Jij = 0, if gi ^ gj 



for all i, j = 1, • • • ,n. 
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For the proof of this proposition, we need the following lemma. 

Lemma 2.5 For i,j £ {1, • • • ,n} with i ^ j, let us consider the following subcodes of C 

C(^.=o) = {x e C | Xi = l,xj = 0} , C(Ij=|) = {x eC \ Xi = l,xj = 1} . 
Then the followings hold 

1. if 9i = 9j , then flC(^=£) = 0, jjC(£=i) = 2 k ~\ 

2. if 9i + g 3 , then ftCQ^) = $P£\z\) = 2 k ~ 2 . 



Proof. The statement is trivial when = gj, so we suppose gi ^ gj. In the following we adopt 
the mod 2 arithmetic for elements in F2. We can express gi, by using some permutations of rows 
if necessary, as follows 

9i = (l_^l 0---0) T . 

a 

In the following we only deal with the case a < k, but the modification to the case a = k is 
trivial. Now we have the following two cases 

case I: there exists / > a such that gij = 1, 
case H: not case I, i.e., gij = for all I > a. 

In the case I, let us fix (mi • • • m a ) in a message m = (mi • • • m^) with mi + • • • + m a = 1, 
which corresponds to Xi = 1, and consider the numbers of codewords with Xj = and Xj = 1 
from the remaining message bits (m Q+ i • • • m^). Then, from the assumption, there exists at 
least one non zero element in g a+ ij, ■ ■ ■ ,gk,j- Hence, the same argument in Lemma 12.11 shows 
that the numbers of codewords with Xj = and Xj = 1 under a fixed (mi, • • • ,m a ) are the 
same. By considering all the possibilities of (mi • • • m a ) with mi + • • • + m a = 1, it gives 

ttc(^) = ttcC;=i) = 2 fc " 2 . 

Next, let us consider the case H. Again by using a permutation if necessary, we have the 
following expressions 

where a = ot\ + 02 and «2 > because of gi 7^ gj. Then we have 

ttC(^=J) = ${m 1 + --- + m ax+a2 = 1 andmi + --- + m Q1 = 0} x 2 k ~ a 

= jj {m ai+ i H h m ai+a2 = 1 and mH h m ai = 0} x 2 k ~ a , 

ttCg'=l) = (t {mi + • • • + m ai+a2 = 1 and mi + • • • + m Ql = 1} x 2 k ~ a 

= (t H 1- rn ai +a 2 = and mi H h m Q , 1 = 1} x 2 fc_a . 

However, the same argument in Lemma |2 . 1 1 implies (JC(^ 1 Zq) = JJC^'z}) = 2 fc ~ 2 . I 



Proof of Proposition 2.4 The (i, j) element in the Jacobi matrix J is given by 



^(p) = {4(p)ff(p) - /i(p)^(p)} /H' 



(p). 
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From Lemma |2. 11 H J (p) = because 



£P(p) = - x2 fe - i - - x2 fe - 1 =0 



where the first term comes from the codewords with Xj = 1 and the second term comes from 
the codewords with Xj = 0. It is also easy to observe that H(p) = (l/2) n x 2 k = 2~ n+k , and 
I\(p) = (l/2) n_1 x 2 k ~ 1 = 2~ n+k . Therefore the diagonal elements are Ju = 1, i = 1, ■ • • , n. 

Next, let us consider the case i ^ j. In this case, if we have gi = gj, then, from Lemma 
[231 if 0) = (l/2) n ~ 1 x 2 fc ~ 1 = 2- n+k . On the other hand, if we have # ^ g j} then if (p) = 
(l/2) n ~ 1 x 2 k - 2 - (1/2)™- 1 x 2 k - 2 = 0. This concludes the proof of Proposition [Ml I 

Two corollaries follow from Proposition 12.41 which characterize the eigenvalues and the eigen- 
vectors of the Jacobi matrix J, and it clearly determines the stability and the stable/unstable 
subspaces of p. To this end, let us denote by Qj a graph whose adjacent matrix is J. Namely, 
the nodes of Qj are 1, ■ • ■ , n, and an undirected edge appears in Qj if and only if Jij = 1. 

Corollary 2.6 Suppose the graph Qj is decomposed into I connected components 

Gj = AiU---UAi, Ai D Aj = Q) if i^j. 

Let ni be the number of nodes in the component Ai, i = 1, ■ • ■ , I. Then all the eigenvalues of J 
are given by 

7ii,n 2) ■ ■ ■ ,ni,Q, 

where the eigenvalues n\, ■ ■ ■ ,n\ are simple and the eigenvalue has (ni + - • -+ni — l) multiplicity. 

Proof. From Proposition 12. 4[ any two nodes in a same connected component have an edge 
between them. Hence, it is possible to transform J into the following block diagonal matrix 



E~ X JE 



( B x • • • \ 
B 2 ••• 

V ■ ■ ■ B l j 



(2.1) 



where E is determined by compositions of column switching elementary matrices, and B{ is an 
ni x rij matrix all of whose elements are 1. The statement of the corollary follows immediately. 



From now on, we treat a Jacobi matrix of the block diagonal form ([2. 1 j) . Obviously, it gives 
no restriction since, if necessary, we can appropriately permute columns of the original generator 
matrix in advance. Let us denote the set of eigenvalues of J derived in Corollary 12.61 by 

ni,0,--- ,0,--- ,ni,0,--- ,0, -- ,n,,0,... ,0, (2.2) 

where the successive 0, • • • ,0 after each m has (ni — 1) elements. In case of n% = 1, we ignore 
the successive 0, • • • , for (i.e., rij, n^+i, 0, • • • ). 
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Corollary 2.7 The Jacobi matrix J is diagonalizable. Furthermore, the corresponding eigen- 
vectors 

J 1 ) ... D (i) ... _«... _(<) ... _(0 ... 
/or (|2.2p under this ordering are given by the following 











(*) 

Pi = 




,-,pg = 













where 



( 1 \ 



?1 



?2 



/ 1\ 

-1 



V 0/ 



(j) 

9 3 



1 

-1 



o(0 





1 

V -i / 



ifere the vectors q^' ■ ■ ■ q^} have n« elements and the first element starts at the (n\+- ■ -+nj_i+l) 

expresses t 
Hi = 1, we on/y Ziaue p^ (u>i£/z q± = (1)). 

Proof. It is obvious from Corollary! 



(i) (i) 

th row. The bold type expresses that the remaining elements in p\ , • • • ,p n [ are 0. In case of 



We finally mention a relationship to a continuous gradient dynamical system. Let us denote 
by Fx (u) a polynomial obtained by removing pi{ui) from a codeword polynomial F x {u) of 
x € C. By using this notation, the map (j 1 . 2 [) can be also described as 



K = fi(u) 



Then it follows that 



Ui - Ui 



Y^x&c F x (u) 

Xi = l 



J2xec UiFi % \u) 

Xi = l 



x »- u Xj=0 Xj=l 



H(u) 



\xi=l 



Xj=l 



Uj(l - Uj) 

H{u) 



( 



Xi=0 



= -Ui)— (logif(u)) . 

This proves the following proposition. 

Proposition 2.8 The rational map (jl.2p maps a point u £ Int(/ n ) to £/ie direction o/grad(log H(u)) 
with a contraction rate — Uj) /or eac/i element i = 1, ■ ■ ■ , n. Especially, u £ Int(/ n ) is a 
/izeci point o/ (|1.2p i/ and on/y i/ it is a /iied point o/ i/ie continuous gradient dynamical system 
du/dt = grad(log H(u)). 
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3 Generator Matrix and Taylor Expansion 

Next, we study a relationship between higher order terms in (|1.5p and a generator matrix G, 
and prove Theorem [TJ For this purpose, the key proposition is given as follows. 

Proposition 3.1 Let I > 2. If any distinct (Z + l) column vectors of G are linearly independent, 
then f^(v) = 0. 

Let us denote a higher order derivative of an i-th element /j with respect to variables 
, u k by 

For the proof of Proposition 13-H we need to study higher order derivatives H ll ' H (p) and 
It 1 H (p)- Let us at first focus on higher order derivatives H n '" n {p). We begin with the fol- 
lowing observation, which characterizes the numbers of subcodes by means of column vectors of 
G. It is noted that we adopt the mod 2 arithmetic for elements in F2. 

Lemma 3.2 Suppose i%, ■ ■ ■ ,ii S {1, • • • , re}, I > 2, are distinct indices, and let 

C(x il H h x i; = 0) = {x € C I Xi x H h x it = 0} , 

C(x h H h x it = 1) = {x € C I x h H h x i{ = 1} 

6e subcodes in C. Then the followings hold 

1- if9h+--- + 9h + 0, then \C{x ix + • • • + x if = 0) = ftC^ + • • • + x i; = 1) = 2 k ~ l . 

2- if 9h~\ h 5i ; = 0, i/ten flC^ H h x i{ = 0) = 2 fc and $C(x h H h x i{ = 1) = 0. 

Proof. By a suitable bit permutation, if necessary, the sum of , ■ ■ ■ , can be expressed as 

9h + --- + 9n = 0---0) T . 

as 

Then an original message (mi • • • m/t) and its codeword {x\ ■ ■ ■ x n ) satisfy the following 

x ix -\ \~x k = (mi • • • m k ) ■ (g h H \- g h ). 

In the case 1 (a ^ 0), it leads to x^ + • • • + Xi t = mi + • • ■ + m a , so the conclusion follows 
from the same argument in Lemma 12.11 The case 2 is trivial from the above expression of 

Xi l ~\~ ' ' ' ~\~ Xi l . I 

The following lemma classifies the value H n ' H (p) based on the column vectors of G. 
Lemma 3.3 Let I > 2. Then H iv " il (p) = if either of 
1. there exist same indices in i±, ■ ■ ■ ,i\ 

2- g h + ■ ■ ■ + 9n + 
is satisfied. Otherwise, that is ft, ■ • • , ii are all distinct and fl^ + • • • + flfy = 0, 

£pi"*i(p) = (-iy2~ n+k+l . 
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Proof. The condition 1 immediately implies H ll '" ll (p) = since the degree of each variable U{ 
in H{u) is 1. Hence we assume all the indices are distinct. 
Let us define the following subcodes 

:= {x 6 C | the number of in x\ x , • • • , x k is odd} 
ZT{ e \ '■= {x G C I the number of in x^ , • • • , x k is even}. 

Then H n '" H (p) can be expressed by 

H^(p) = {l/2) n - l {mi e X ~ K dd 0- 

Suppose gi x + • • ■ + g k ^ 0. Then, by Lemma [3T2| we have §C{xi x + • • • + x k =0) = ftCfe^ + 
• • • + Xi t = 1). On the other hand, when I is odd (or even, resp.), 

Z iv d ii = ^ h x i; = 0) (or = C(x il H hx,, = l),resp.), 

= C(x h + ■ ■ ■ + x k = 1) (or = Cfoi + • • • + x i( = 0), resp.). 

Therefore it concludes H n '" H (p) = 0. The statement for + • • • + g% t = is similarly derived 
from Lemma 13.21 I 

Next, we try to classify the value I^ 1 H (p)- 

Lemma 3.4 Suppose ,ii € {1, ■ • • , n},l > 2, are distinct indices and let us define two 

subcodes in C by 

C ^ ii+ x .!+i i;=0 ^ := {x e C | Xi = 1, Xjj H h ^ = 0} , 

C (xi 1 +--+Xi l =l) ■= {x (z C \ Xi = 1, Xij + • • • + = 1} . 
Then the following classification holds 

1. if Oh +••• + %= 0, fcen #C ( Xil+ ?'S 4l=0 ) = »C (^V^ =i) = 0. 

2. if0^ gil +--- + g H ^ g i; then jJC (s^S^o) = ttC (x 4l +* i .?i 4l =i) = 2 fc ~ 2 . 

5. i/0 ^ 9il + • • ■ + g k = g i; then JJC (^.^ =0) = 0, JJC (^+^=1) = 2*- 1 . 
Proof. Since we have 

x h -\ h x it = (mi • • • m k ) ■ (g h H h g k ), 

the case 1 and 3 are trivial. So, let us assume 7^ g^ + ■ ■ ■ + g k ^ g%. The proof is similar to 
that of Lemma 12.51 By using a suitable permutation, let us express g\ as follows 

gi = (l_^_L 0---0) T . 

a 

Here we only deal with the case a < k again, since the modification for a = k follows immediately 
from the following case H. We have two situations 
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case I: there exists f3 > a such that gp^ + ■ ■ ■ + gp^ = 1. 
case E: not case I, i.e., gp^ + • • • + gp^ = for all (3 > a. 

In case I, let us fix (m\ ■ ■ ■ m a ) with m\ + • • • + m a = 1, which corresponds to Xi = 1, and 
consider the numbers of codewords with Xi x + • • • + x^ = or = 1 for the remaining message 
bits (m a+ i ■ ■ ■ nik)- From the assumption, the /3-th element of the vector g% x + ■ ■ ■ + g% x is 1, 
and, by applying Lemma 13.21 to the subvector from (a + l)-th to k-th elements, the numbers of 
codewords with xi x + ■ ■ ■ + x^, = or = 1 are the same for each (mi ■ ■ ■ m a ). Hence we have 

JjC ^ Zii+ a: .!+^ ;= o^ = JJC ( Xii ^J.± Xii= i\ = 2 k ~ 2 . The proof for case II is almost parallel to that of 
Lemma 12.51 so we omit it. I 

Let us introduce the following notations, which are similar to those in the proof of Lemma 



Zf 1 dd i[ (xi = 1) = {x £ C | the number of in x^, ■ ■ ■ , x^ is odd and Xj = 1}, 
(xi = 1) = {x & C | the number of in x^, ■ ■ ■ , x^ is even and Xj = 1}. 

Then, for odd I (or even I, resp.), we have 

Z° dd H (x t = 1)=C (, n ^ i=0 ) (or = C ( Xil+ x t.+l il= i) , resp.), 
Z^( Xi = I) = C ( Xii+ x *? Xii=1 ) (or =c( Xii+ x ?.? Xi - ),re S p.). 

Lemma 3.5 Let I > 2. Then J* 1 H (p) is classified as 

0, If CO or CI 

L?~ i i(p) = { (_i)2+i 2 -"+fe+'-l ! IfC2 
r ^-n+k+l-l If C3 



where each condition is given by 

there exist same indices in - ■ ■ ,ii 



CO 
CI 
C2 
C3 



CO and ^ g h + ■■■ + g k ^ gi (here CO means "NOT CO"; 

CO and g ix H h g k = gi 

CO and g h H h =0 



Proof. The condition CO immediately implies I? 1 '" '(p) = 0. Hence we assume all the indices 
are distinct. The remaining proof follows directly from Lemma [3.41 for each case. First of all, let 
us study the case i £ ■ ■ ■ , i{\. By using the notations introduced before the lemma, we have 

C"» = (l/2) B -'(tl^ (x, = 1) - = !))• 



Therefore the condition CI implies 1^" l (p) = by Lemma [37 

On the other hand, if we assume the condition C2, then %C (^ Xii + X -+x i[ =o) = and JjC (^ Xii +.?^ l Xi 
2 fc-l from LemmaEai If I is even, = -(1/2)"-' x 2 fc ~ 1 = _2- n + fc+/ - 1 . Similarly, if / is 

odd, lt v " il (p) = 2- n+k+l ~ 1 . Hence, we have I- 1 '"^(p) = (-l)'+ 1 2- n+ *+ 1 - 1 for the condition C2. 
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For the condition C3, the role of jjC (x^-h'+xi^o) an d W- +^=1) changes each other 

from Lemma 13.41 so it just leads to the opposite sign in J* 1 H {p) to that for the condition C2. 

Next, let us study the case i € {ii, ■ ■ ■ , ii}. Without loss of generality, let us suppose i = i[. 
Then we have 

ir^ip) = {l^m^-M = i) - »^ d 4-x(^ = i))- 

Therefore the condition CI implies I 1 - 1 %l {p) = by Lemma 13, 41 (or Lemma 12.51 for 1 = 2). 

For the condition C2 (I must be > 2), we have (JC ^z^+.^+z* 1= o) = 2 fc ~ 1 and \C ( Xil +-^+xlj 1 = 

from Lemma 13.41 Hence, by the same calculation as that for i £ {ii,-- - ,ii}, we have 
I^" il (p) = (_ 1 )«+i 2 -™+fe+;-i. 

Finally, let us consider the condition C3. For I > 2, we have ftC ^z^+.^+a;* x =oj = and 



we 



t(C L <i+ .. a: 4.^_ i= iJ = 2 fc - 1 by Lemma [33] again. So, 7j r " t! (p) = (-l) I 2- n+ *+'- 1 . For / = 2, 
can not use Lemma 13.41 because of / — 1 = 1 < 2. However, Xj = holds from the assumption 
gi = gi x . Hence a direct calculation shows Il ll2 {p) = (l/2) n ~ 2 x 2 fc ~ 1 = 2~ n+fe+1 , which is the 
formula for I = 2. I 

Before proving Proposition 13-H let us show the following two lemmas. The proofs of them 
are easy application of induction. 

Lemma 3.6 The derivative (1/H) 11 "" 11 is given by 

= i-^JniH' 1 ■ ■ - Hil ) + ■ ■ ■ + (-^iSt E ( Hrl ■ ■ ■ Rrk ) + ■ - w Hil " il ' 

J C(l,k) 

where the summation for the k-th term (1 < k < I) is taken on all combinations C(l,k) for 
dividing i\, ■ ■ ■ ,ii into k groups. Here r 1 , • • • , r k represent a decomposition of ■ • • , i{\; 

{*!,... ,i l } = ul 1 r\ rV0, rW = (i ^ j). 

Lemma 3.7 Let I > 2. Then the derivative j? 1 is given by 

^_^(i) + ... + |*(')V. + *(±) \ 

where Pj and are a decomposition of • • • , ii}; 

p} n q^ fc = 0, pj U qj = {h , ■ ■ ■ , *,}, ^ = k, 
and the summations are taken on all the combinations of the decompositions. 
Now we prove Proposition 13.11 

Proof of Proposition HOI Let us assume that any distinct (I + 1) column vectors of G are 
linearly independent. Then, from Lemma 13.31 13.61 and 13.71 we have 



1 
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On the other hand, from Lemma 13.51 I 1 ^ " l (p) is 0. The proof is completed. I 



Finally, we are in the position to prove Theorem HJ 

Proof of Theorem [TJ The formula for the case I = 2 is given by Proposition 12.41 Let us assume 
I > 3. From Proposition [3J3 all nonlinear terms with orders less than I are zero, so we only study 
the l-th nonlinear terms. From the assumption and similar argument in the proof of Proposition 
13-H the derivative /* : L " H (p) is given by 



i 



Hip) 



Then, the classification in Lemma [331 shows that n (p) = in CO and CI. Moreover, the 
condition C3 does not occur from the assumption. Hence, only nonzero terms are derived from 
the condition C2. It should be noted that the set of indices ■ ■ ■ ,ii) satisfying the condition 
C2 with i is exactly the same as 0^ • Hence, Lemma 13.51 results in 

ff ii (p) = i i r i \p) f 1 



H(p) 



(1/2)" x 2 k 



for (ii,--- ,ii) E 0v . Finally, we have the following 



, m l . . , v m„ 



miH hm n =l 

m k >0 



The identity in fll .Tf) is derived by just substituting V{ = Ui — 1/2. It completes the proof. 



4 Relationship to Algebraic Geometry Codes 

The purpose of this section is to derive Corollary |4.51 This corollary shows that usual techniques 
in algebraic geometry codes can be applied to control, not only the minimum distance of the 
code, but also the approximate ML decoding. For the definitions of basic tools in algebraic 
geometry such as genus, divisor, Riemann-Roch space, differential, and residue, we refer to pQ. 
We also request basic knowledge of algebraic geometry codes in this section (e.g., see [1], [8] and 

Let F q be a finite field with q > 1 elements. Let us first recall two classes of algebraic 
geometry codes called geometric Reed-Solomon codes and geometric Goppa codes. Let X be an 
absolutely irreducible nonsingular projective curve over ¥ q . For rational points Pi,-- - , Pa on 
X, we define a divisor on X by D = P\ + • • • + P^. Moreover, let D' be another divisor whose 
support is disjoint to D. We assume that D' satisfies the following condition 

2g - 2 < deg(D') < n 
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for the sake of simplicity. Here g is the genus of X. Geometric Reed-Solomon codes are charac- 
terized by the Riemann-Roch space associated to D 1 

C(D') := {4> G F*(X) | (0) + D' > 0} U {0}, 

where ¥*(X) is the set of nonzero elements of the function field ¥ q (X), and (</>) is the principal 
divisor of the rational function <fi. The fundamental fact that the Riemann-Roch space C(D') is 
a finite dimensional vector space leads to the following definition. 

Definition 4.1 The geometric Reed-Solomon code C(D,D') of length n over ¥ q is defined by 
the image of the linear map a : C(D') — > ¥ q given by a(<p) = (4>(Pi), • • • , 4>(Pn)). 

On the other hand, geometric Goppa codes are defined via differentials and their residues. Let 
us denote the set of differentials on X by Q(X), and define for each divisor D 

Q(D) := {co G Q(X) | (w) - D > 0}, 

where (u) is the divisor of the differential oo. 

Definition 4.2 The geometric Goppa code C*(D,D') of length h over ¥ q is defined by the 
image of the linear map a* : Q(D' — D) — >• F™ given by a*(oj) = (Respj (u), ■ ■ ■ ,Resp-(w)), 
where Resp(w) expresses the residue of w at P. 

The following propositions are an easy consequence of the Riemann-Roch theorem. 

Proposition 4.3 (e.g., [4], [8], [9]) 

1. The dimension of the geometric Reed-Solomon code C(D,D') is k = deg(D') — g + 1 and 
the minimum distance satisfies d > n — deg(D'). 

2. The dimension of the geometric Goppa code C*(D, D') is k = h — deg(D') + g — 1 and the 
minimum distance satisfies d > deg(D') — 2g + 2. 

3. The codes C(D,D') and C*(D, D') are dual codes. 

It should be noted that the minimum distances of these two codes are controlled by the genus of 
X and the choice of the divisor D', and, as a result, they induce appropriate linear independence 
on their parity check matrices. 

For an application of Theorem [U we need to derive expanded codes over F2 from geometric 
Reed-Solomon and geometric Goppa codes over ¥ q ,q = 2 s . Let C q be a code over F g with length 
n, and ex, ■ ■ ■ , e s G ¥ q be a basis of F2-vector space ¥ q . This basis naturally induces the map 
F™ 9 x h-> x G F - 2 S by expressing each element xt in x = (xi, • • • , Xn) as coefficients of F2-vector 
space. Then the expanded code of C q over F2 is defined by C2 := {x \ x G C q }. A relationship 
between C q and C2 is given by the following proposition. 

Proposition 4.4 // a code C q has parameters [h,k,d], where h is the code length, k is the 
dimension, and d is the minimum distance, then its expanded code Ci has the parameters 
[hs, ks, d! > d\. 

Now we apply Theorem [1] to geometric Reed-Solomon/Goppa codes. Let n = hs, and let 
C2(D, D') and C^D^D') be the expanded codes over F2 of a geometric Reed-Solomon code 
C(D, D') and a geometric Goppa code C*(D, D') over ¥ q . Then, we have the following corollary. 
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Corollary 4.5 The expanded geometric Reed-Solomon code C2(D,D') has the minimum dis- 
tance d > h — deg(D'). Furthermore, there exists an l-th order approximate ML decoding with 
I > deg(D') -2g + l such that / f («) = m + ff\v). 

Proof. The first statement is the property of a geometric Reed-Solomon code and its expansion. 
The second statement follows from Theorem [1] and the duality of C(D,D') and C*(D,D'). I 



Example: Hermitian Code (e.g., [I], [8], [9]) 



Let q 



be a power of 2. The Hermitian curve T~L is given by the homogeneous equation 



X r+ + Y r+1 + Z r+1 = and its genus is g = r(r — l)/2, because there are no singular points. 
It is known that the number of rational points over ¥ q is r 3 + 1. 

Let us fix r = 2 as an example. Then, the following is the list of the rational points on T~i: 

Pi = (1,0,0), P 2 (1,0,^), P 3 (L0,1), P 4 = (1,0,0), P 5 (l,w,0), 

P 6 (M,0), P 7 = (0,0,1), P 8 (0,w,l), Q = (0,1,1), 

where cj is a primitive element of F4 and = 1 + cj (i.e., F4 = {0,1, u>,0}). Let us suppose 
D = Pi + • • • + Ps (hence the code length is n = 8), and D' = mQ, 2g — 2 < m < h. A basis of 
the Riemann-Roch space C(D') with m = 4 is given by 

f X Y X^ 
C{AQ) = Span 1 1, 7T ^, ^- z? 

Then, we can explicitly show a generator matrix of the expanded geometric Reed-Solomon code 
C(D,D') over F2 as follows 

/100000001 



G 



010000000 
001000001 



1 
1 



1 





(4.1) 



000000101 

V000000010 



1 
1 



1 1 1 1 \ 
10 11110 

1 1 

1 1 1 
10 10 10 

1 
1 

10 110 11/ 

It should be noted that this case leads to a self-dual code C(D,D') = C*(D, D'). Hence, its 
parity check matrix H is the same as G, and a direct calculation proves that the minimum 
distance of this code is 4. It means that we have the 3-rd order ML decoding in Corollary 14.51 
The explicit forms of fi(u),i = 1, • • • ,16, are given as 

fi(u) =ui+ Av 7 (v 3 v 9 + v 5 v n + vi 3 v 15 ), f 2 (u) 



f 3 (u) = U 3 + 4v 9 (viV 7 + V 5 Vu + V13V15), U(u) = 

f 5 (u) = U 5 +Av 11 (ViV 7 + V 3 V 9 +V 13 V 15 ), f 6 (u) = 

f 7 (u) = u 7 + 4-ui(u 3 V9 + v 5 vn + V13V15), f 8 (u) = 

f 9 (u) = u 9 + Av 3 (viv 7 + v 5 vn + vi 3 v 15 ), fio(u) 

fn(u) = tin + 4l> 5 (?;i7J7 + v 3 v 9 + V13U15), f\ 2 {u) 

f 13 (u) = ui 3 + <iv 15 (viv 7 + v 3 v 9 + v 5 vn), fu(u) 

fis(u) = wis + <±v 13 (viv 7 + v 3 v 9 + v 5 vn), fw(u) 
where v j = ui — 1/2, i = 1, • • • ,16. 



u 2 + 4u 8 (v 4 «io + v e v 12 + vuvie), 
u 4 + <iv w (v 2 v 8 + v e v 12 + vuviq), 

Uq + <ivi 2 (v 2 V8 + V4V10 + VuVxq), 

u 8 + 4v 2 (w 4 wio + v 6 vi 2 + vuvw), 

= U10 + 4u 4 (i;2V8 + VqVi 2 + VuViq), 
= U12 + 4l> 6 (7J 2 t>8 + 4^4^10 + «14«16) 

= ni4 + 4v 16 (v 2 v 8 + v^vio + «6«12), 
= iti6 + 4i>i 4 (t;2t;8 + v 4 uio + U6V12), 
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5 Discussions 



To conclude this paper, we address the following comments and discussions, some of which will 
be important for designs of good practical error-correcting codes. 

5.1 Stability 

Codeword fixed points in C are stable from Proposition 12. 3\ and non-codeword poles in F^ \ C 
are unstable from Proposition 12.81 in the sense that nearby points to a pole in FJ? \ C leave 
away from the pole. Let us recall that each n-bit received sequence y G Fj and its initial point 
u° £ I n are related by (II. 3D and it is characterized that y is the closest point to u° in (dl) n . 
Hence, if the received sequence y is a codeword, then u° may approach to the codeword fixed 
point y. This obviously depends on whether or not u° is located in the attractor region of the 
codeword fixed point, although it is actually the case when e is small enough because of the 
stability. So far, a general structure of the attractor region for each codeword fixed point is not 
yet known. However this is an important subject since it is indispensable to give an estimate 
of error probabilities of ML decoding and its approximation. Similar arguments also hold for a 
non-codeword received sequence and its repelling property. 

5.2 Local dynamics around p 

Let us recall that the ML decoding rule is given by (ll.4jl which checks the location of the image 
for an initial point u° to the point p. Hence the local dynamics around the point p will be 
important for decoding process. In the following, we explain the local dynamics around p in two 
different cases: the Jacobi matrix J at p is (i) identity or (ii) not identity. 

In the case (i), the local dynamics around p is precisely determined by Theorem [TJ As 
explained after Theorem [H the nonlinear dynamics around p is closely related to the encoding 
structure of the code and the decoding process. 

In the case (ii) , let us suppose that Bi in Corollary 12.61 induces unstable eigenvalues rii > 1 
and let us focus on its stable/unstable eigenspaces Si/Ui, respectively. From its eigenvector, Ui 
is given by the 1 dimensional subspace spanned by 



This plays a role to make , ■ ■ ■ , Ui n . to be equal, and it reflects the fact = ■ ■ ■ = gi n . . It 
means that the unstable subspace U points to codewords' directions, i.e., C. On the other hand, 
Si is spanned by the stable eigenvectors p 2 , •• • ,Pn( in Corollary 12.71 Contrary to the unstable 
eigenvector, these eigenvectors play a role to generate different elements in ,Uj n . and 

point to non-codewords' directions under time reversal, i.e., FJ? \C. Therefore, the fixed point p 
can be regarded as an indicator to codewords in the sense that non-codeword elements shrink 
and codeword elements expand around p. 

In both cases, further nonlinear analysis of center/stable/unstable manifolds of p will be 
useful for finding suitable encoding rules and estimating the decoding performance for the ap- 
proximate ML. 



((),••• ,0,1,..- 




1,0 



••• ,0). 
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5.3 Hyperbolicity of p and rate restriction 

From the above argument on the fixed point p, it seems to be appropriate to design a generator 
matrix to be hyperbolic at p, because p separates expanding and shrinking directions properly 
and these separations have an affect on the decoding performance. However, if p is hyperbolic, 
then the coding rate must satisfy r = k/n < 1/2 by Corollary 12,61 Namely, the hyperbolicity 
prevents a code to have a high coding rate greater than half, although this is not a strict restric- 
tion in particular applications like wireless communication channels. Therefore it is necessary 
to have a center eigenspace at p for a code with the rate r > 1/2. 

5.4 Normal form theory in dynamical systems 

The normal form theory in dynamical systems (e.g., see [TO]) enables us to transform a map 
into a simpler form by using a near identity transformation around a fixed point. One of the 
essential points is that nonresonant higher order terms can be removed from the original map 
by this transformation. Theorem [1] can be interpreted from the viewpoints of normal forms in 
such a way that an algebraic geometry code gives only zero nonresonant terms in the expansion 
of f(u) at p. Then, it leads to the following natural question whether a code whose rational 
map does not have resonant terms, but has nonresonant terms which are not necessarily zeros in 
its expansion is a good error-correcting code or not through a near identity transformation. At 
least, this class of codes contains algebraic geometry codes as a subclass, and a similar statement 
to Corollary 14.51 holds through near identity transformations. 

5.5 Relation to LDPC codes 

It seems to be valuable to mention a relationship to LDPC codes [3], [5], which are a relatively 
new class of error-correcting codes based on iterative decoding schemes (for a reference to this 
research region, see [6]). The iterative decoding schemes mainly use so-called sum-product al- 
gorithm for ML decoding and deal with a marginalized conditional probability in (jl.ip as a 
convergent point. Although this coding scheme gives a good performance in some numerical 
simulations, mathematical further understanding of the sum-product algorithm and ML decod- 
ing is desired to design better coding schemes. From the viewpoint of dynamical systems, it 
seems to be natural to formulate the sum-product algorithm or ML decoding itself as a certain 
map, and then analyze its mechanism. The strategy in this paper is based on this consideration. 
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